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Abstract. Selection comparator networks have been studied for many 
years. Recently, they have been successfully applied to encode cardinal¬ 
ity constraints for SAT-solvers. To decrease the size of generated formula 
there is a need for constructions of selection networks that can be effi¬ 
ciently generated and produce networks of small sizes for the practical 
range of their two parameters: n - the number of inputs (boolean vari¬ 
ables) and k the number of selected items (a cardinality bound). In 
this paper we give and analyze a new construction of smaller selection 
networks that are based on the pairwise selection networks introduced 
by Codish and Zanon-Ivry. We prove also that standard encodings of 
cardinality constraints with selection networks preserve arc-consistency. 


1 Introduction 

Comparator networks are probably the simplest data-oblivious model for sorting- 
related algorithms. The most popular construction is due to Batcher [3] and it’s 
called odd-even sorting network. For all practical values, this is the best known 
sorting network. However, in 1992 Parberry [S] introduced the serious competitor 
to Batcher’s construction, called pairwise sorting network. In context of sorting, 
pairwise network is not better than odd-even network, in fact it has been proven 
that they have exactly the same size and depth. As Parberry said himself: ”It is 
the first sorting network to be competitive with the odd-even sort for all values 
of n There is a more sophisticated relation between both types of network 
and their close resemblance. For overview of sorting networks, see Knuth |7i or 
Parberry [8]. 

In recent years new applications for sorting networks have been found, for 
example in encoding of pseudo boolean constraints and cardinality constraints for 
SAT-solvers. Cardinality constraints take the form X\ +X 2 + • • • + x n ~ k, where 
Xi, X 2 , ■ ■ ■, x n are boolean variables, A: is a natural number, and ~ is a relation 
from the set {=, <, <, >, >}. Cardinality constraints are used in many applica¬ 
tions, the significant one worth mentioning arise in SAT-solvers. Using cardinal¬ 
ity constraints with cooperation of SAT-solvers we can handle many practical 
problems that are proven to be hard. Works of Asm et al. m describe how 
to use odd-even sorting network to encode cardinality constraints into boolean 
formulas. In [6] authors do the same with pseudo boolean constraints. 
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It has already been observed that using selection networks instead of sorting 
networks is more efficient for the encoding of cardinality constraints. Codish and 
Zazon-Ivry jT] introduced pairwise cardinality networks, which are networks de¬ 
rived from pairwise sorting networks that express cardinality constraints. Two 
years later, same authors 0 reformulated the definition of pairwise selection net¬ 
works and proved that their sizes are never worse than the sizes of corresponding 
odd-even selection networks. To show the difference they plotted it for selected 
values of n and k. 

In this paper we give a new construction of smaller selection networks that 
are based on the pairwise selection ones and we prove that the construction is 
correct. We estimate also the size of our networks and compute the difference in 
sizes between our selection networks and the corresponding pairwise ones. The 
difference can be as big as n log n/2 for k = n/2. Finally, we analyze the standard 
3(6)-clause encoding of a comparator and prove that such CNF encoding of 
any selection network preserves arc-consistency with respect to a corresponding 
cardinality constraint. 

The rest of the paper is organized in the following way: in Section 2 we give 
definitions and notations used in this paper. In Section 3 we recall the definition 
of pairwise selection networks and define auxiliary bitonic selection networks 
that we will use to estimate the sizes of our networks. In Section 4 we present 
the construction of our selection networks and prove its correctness. In Section 
5 we analyze the sizes of the networks and, finally, in Section 6 we examine the 
arc-consistency of selection networks. 

2 Preliminaries 

In this section we will introduce definitions and notations used in the rest of the 
paper. 

Definition 1 (input sequence). Input sequence of length n is a sequence of 
natural numbers x = (x \,..., x n ), where Xi £ N (for all i = l..n). We say that 
x £ N™ is sorted if Xi > Xi+± (for each i = l..n — 1). Given x = (x \,..., x n ), 
y = (yi ,..., y n ) we define concatenation as x :: y = (x ±,..., x n , y i, • ■ ■, y n )- 
We will use the following functions from N ra to N™/ 2 : 

left(x) = (aq,.. .,x n/2 ), right(x) = {x n/2+ i, ■ ..,x n ) 

Let n, to £ N. We define a relation V ’ on N” x N m . Let x = (xi ,..., x n ) and 
V = {yi, ■ ■ ■,Vm), then: 

% y 1 k 'Ll yj 

Definition 2 (comparator). Let x £ N n and leti,j £ N, where 1 < i < j < n. 
A comparator is a function Cij defined as: 


c i,j(x) = y <*=> yi = max{rj, Xj} A yj = mirn^, Xj} A x k = y k 
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Definition 3 (comparator network). We say that f n : N ra — ► N™ is a com¬ 
parator network of order n, if it can he represented as the composition of finite 
number of comparators , namely, f n = j 1 o • • • o Cj fc j k . The size of comparator 
network (number of comparators) is denoted by |/”|. Comparator network of size 
0 is denoted by id n . 

Traditionally comparator networks are presented as 
circuits that receives n inputs and permutate them us¬ 
ing comparators connected by ’’wires”. Each comparator x i 
has two inputs and two outputs. The ’’upper” output is 
the maximum of inputs, and ’’lower” one is minimum. As * 2 
an example look at Figure [T] where we present a com¬ 
parator network of order 4, max 4 , that outputs maxi- 3 
mum from 4 inputs on its first output, namely, y\ = ^ 

max{;ri, Xi, £ 3 , X 4 }. It is well known that \max n \ = n— 1. 

We will often omit explicit declaration of order of com¬ 
parator network when it is not ambiguous. 

Definition 4 (bitonic sequence). A sequence x £ is a bitonic sequence if 
Xi < .. • < Xi > ... > x n for some i, where 1 < i < n, or a circular shift of such 
sequence. We distinguish a special case of a bitonic sequence: 

— v-shaped, if x\ >...> x^ <...< x n 

and among v-shaped sequences there are two special cases: 

— nondecreasing, if x 1 < ■ ■ ■ < x n , 

— nonincreasing, if x\> .. .> x n . 


-- Vi 

- Vi 

- y 3 

- Vi 

Fig. 1 


Definition 5 (sorting network). A comparator network f n is a sorting net¬ 
work, if for each x £ N ra , f n {x) is sorted. 

Two types of sorting networks are of interest to us: odd-even and pairwise. 
Based on their ideas, Knuth 0 (for odd-even network) and Codish and Zazon- 
Ivry 0 (for pairwise network) showed how to transform them into selection 
networks (we name them oe sel r ( and pwsel £ respectively). 

Definition 6 (top k sorted sequence). A sequence x £ N” is top k sorted, 
with k < n, if (x\,..., Xk) is sorted and {xi ,..., Xk) (xk+ 1 , • ■ ■, x n ). 

Definition 7 (selection network). A comparator network ff (where k < n) 
is a selection network, if for each x £ N n , f]f{x) is top k sorted. 

To simplify the presentation we assume that n and k are powers of 2. 

A clause is a disjunction of literals (boolean variables x or their negation 
*x ). A CNF formula is a conjunction of one or more clauses. 

A unit propagation (UP) is a process, that for given CNF formula, clauses are 
sought in which all literals but one are false (say l) and l is undefined (initially 
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only clauses of size one satisfy this condition). This literal l is set to true and 
the process is iterated until reaching a fix point. 

Cardinality constraints are of the form X\ + ... + x n ~ k , where k £ N 
and ~ belongs to {<,<,=, >, >}. We will focus on cardinality constraints with 
less-than relation, i.e. X\ + ... + x n < k. An encoding (a CNF formula) of such 
constraint preserves arc-consistency, if as soon as k — 1 variables among the xf s 
become true, the unit propagation sets all other xf s to false. 

In Sj authors are using sorting networks for an encoding of cardinality con¬ 
straints, where inputs and outputs of a comparator are boolean variables and 
comparators are encoded as a CNF formula. In addition, the fc-th greatest out¬ 
put variable yk of the network is forced to be 0 by adding ~^yk as a clause to 
the formula that encodes X\ + ... + x n < k. They showed that the encoding 
preserves arc-consistency. 

A single comparator can be translated to a CNF formula in the following way: 
let a and b be variables denoting upper and lower inputs of the comparator, and 
c and d be variables denoting upper and lower outputs of a comparator, then: 

fcomp(ci, 6, c, d) <t=> (c <t=> a V b) A (d <t=> a A b) 

is the full encoding of a comparator. Notice that it consists of 6 clauses. Let / 
be a comparator network. Full encoding <f> of / is a conjunction of full encoding 
of every comparator of /. 

In [2j authors observe that in case of ~ being < or <, it is sufficient to use 
only 3 clauses for a single comparator, namely: 

hcomp(a 7 b, c, d) <t=> (a => c) A (b => c) A (a A b => d) (1) 

(cl) (c2) (c3) 

We call it: half encoding. In [2] it is used to translate odd-even sorting network 
to encoding that preserves arc-consistency. We show a more general result (with 
respect to both [B] and ®), that half encoding of any selection network preserves 
arc-consistency for the ”<” and ”<” relations. Similar results can be proved for 
the ”=” relation using the full encoding of comparators and for the ”>” or 
”>” relations using an encoding symmetric to hcomp(a,b,c,d), namely: ( d => 
a) A (d => b) A (c => a V b ). 

3 Pairwise and bitonic selection networks 

Now we present two constructions for selection networks. First, we recall the 
definition of pairwise selection networks by Codish and Zazon-Ivry |5]. Secondly, 
we give the auxiliary construction of a bitonic selection network bitsel that 
we will use to estimate the sizes of our improved pairwise selection network in 
Section 5. 

Definition 8 (domination), x £ N n dominates y £ N n if Xi > y-, (for i = 

l..n). 
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Definition 9 (splitter). A comparator network f n is a splitter if for any se¬ 
quence x £ N n , ify=f n (x), then left(y) dominates rightify). 

Observation 1 . We can construct splitter split 71 by joining inputs (i,n/2 + i) , 
for i = l..n/2, with a comparator. Size of a splitter is \split n \ = n/ 2. 

Lemma 1. If b £ N" is bitonic and y = split n (b), then left{y) and rightly) 
are bitonic and left(y) right(y). 

Proof. See Appendix B of [3]. □ 

Network 1 ( pw.sel see jSj). Input: any x £ N n . 

1. If k = 1, return max n {x). 

2. If k = n, return oesort n {x). 

3. Compute y = split(x). 

4. Compute l = pw-sef^ 2 (y) and f = pwsef^iy)■ 

5. Compute pwjmerge^(l :: f). 


Notice that since we introduced a splitter as the third step, in the recursive 
calls we need to select k top elements from the first half of y, but only k /2 
elements from the second half. The reason: r k / 2 +i cannot be one of the first k 
largest elements of l :: f. First, rk/ 2+1 is smaller than any one of (ri,... ^Tk/f) 
(by the definition of top k sorted sequence), and second, (li,... ,l k / 2 ) domi¬ 
nates (r 1 ,..., rfc/ 2 )> so r k / 2 +i is smaller than any one of (Zi,..., l k / 2 )- From this 
argument we make the following observation: 

Observation 2. If l £ N"/ 2 is top k sorted, f £ N"/ 2 is top k/2 sorted and 
(Zi, - - -, ifc/ 2 ) dominates (r \,..., r k / 2 )i then k largest elements of l :: f are in 
(h,...,l k ) :: (ri,...,r fe/2 ). 

The last step of Network [l] merges k top elements from l and k/2 top elements 
from f with so called pairwise merger. We will omit the construction of this 
merger, because it is not relevant to our work. We would only like to note, that 
its size is: \pwjmerge k \ = fclogfc — k + 1. Construction of the merger as well as 
the detailed proof of correctness of network pw.sel jj? can be found in Section 6 

of El- 

Definition 10 (bitonic splitter). A comparator network f n is a bitonic split¬ 
ter if for any two sorted sequences x,y £ if z = bit_split n (x :: y), then 

( 1 ) left(z ) >; right(z) and ( 2 ) left(z) and right(z) are bitonic. 

Observation 3. We can construct bitonic splitter bit_split n by joining inputs 
(i,n — z + 1), for i = l..n/2, with a comparator. Size of a bitonic splitter is 
\bit_split n \ = n/ 2 . 
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We now present the procedure for construction of the bitonic selection net¬ 
work. We use the odd-even sorting network oesort and the network bitjnerge 
(also by Batcher [3]) for sorting bitonic sequences as black-boxes. As a reminder: 
bit_merge n consists of two steps, first we use y = split 71 {x ), then recursively com¬ 
pute bit_merge n / 2 for left(y) and right(y) (base case, n = 2 , consists of a single 
comparator). Size of this network is: \bitjmerge n \ = nlogn/2. 

Bitonic selection network bit_seV/ is constructed by the following procedure. 

Network 2 {bitsel 7 /). Input: any x £ N”. 

1. Let l = n/k. Partition input x into l consecutive blocks, each of size k, then 
sort each block with oesort k , obtaining B \,..., Bi. 

2. While l > 1, do the following: 

(a) Collect blocks into pairs (Bi, B 2 ), ■.., ( Bi_i , Bf). 

(b) Compute iji = bitsplit 2k (Bi :: Bi + 1 ) for each i £ {1,3,..., l — 1}. 

(c) Compute = bit_merge k (left(iji)) for each result of previous step. 

(d) Let 1 = 1/ 2. Relabel B[ to Bi, for 1 < i < l. 


Theorem 1. A comparator network bit self constructed by the procedure Net¬ 
work^ is a selection network. 

Proof. Let x £ N n be the input to bitsel After step one we get sorted sequences 
B±,..., Bi, where l = n/k. Let l m be the value of l after m iterations. Let 
B™, ..., Bff be the blocks after m iterations. We will prove by induction that: 

P(m): if Bi,..., Bi are sorted and are containing k largest elements of x, then 
after m-th iteration of the second step: l m = 1/2 m , B ™,..., Bj 71 are sorted and 
are containing k largest elements of x. 


If m = 0, then l = 1, so P(m) holds. We show that V m > 0 ( P(m ) => P{m + 1)). 
Consider (m+l)-th iteration of step two. By the induction hypothesis l m = 1/ 2 m , 
B ™,..., B™ are sorted and are containing k largest elements of x. We will show 
that (m + l)-th iteration does not remove any element from k largest elements of 
x. To see this, notice that if y, = bitsplit 2k (B™ :: Bff x ) (for i e {1,3,..., l m — 
1}), then left(yi ) >z rightly/) and that left(yi) is bitonic (by Definition 10). 
Because of those two facts, right{yi) is discarded and leftfyf) is sorted using 
bit_merge k . After this, l m +i = lm/ 2 = 1/2 m+1 and blocks B ™ +1 ,..., B/ 7 ^ are 
sorted. Thus P{m + 1) is true. 

Since l = n/k , then by P{m) we see that the second step will terminate after 
m = log ^ iterations and that B i is sorted and contains k largest elements of 
x. □ 


Schema of construction of bitonic selection network is shown in Figure[2] The 
size of bitonic selection network is: 
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/\\/ /\\//i 


oe sort k 


bit_split 2k 

bit_merge k 

bit_split 2k 

bit_merge k 


Fig. 2: Bitonic selection network - schema of construction 


\bitsel%\ = j\oesort k \ + (7 ~ l) (\bit-split 2k \ + \bit.merge k \) 

rZ \ /C / 

11 In 

= -nlog 2 k + -nlogfc + 2 n — -fclogfc — k — — 

4 4 2 k 

In Figure [3] we present bitonic and pairwise selection networks for n = 
k = 2 . 


( 2 ) 
8 and 


(a) 


(b) 


Fig. 3: a) bitonic selection network; b) pairwise selection network; n = 8 , k = 2. 
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4 New Smaller Selection Networks 

As mentioned in the previous section, only the first kj 2 elements from the second 
half of the input are relevant when we get to the merging step in pwsel k . We 
will exploit this fact to create a new, smaller merger. We will use the concept 
of bitonic sequences, therefore the new merger will be called pwJ>it-merge k and 
the new selection network: pwJbitsel k . The network pw-bitsel k is generated 
by substituting the last step of pwsel £ with pwJbitjmerge k . The new merger 
consists of two steps: 

Network 3 ( pwJbitjmerge/J). Input: l :: f, where l £ N "/ 2 is top k sorted and 
f £ N n / 2 is top k/2 sorted and (l \,... ,4/ 2 ) dominates (fi,... ,r k / 2 ). 

1 . Compute y = bitsplit k (l k / 2 +i,.-.,h,ri,...,r k / 2 ), let b = (4, •••,4/2) ■■ 
{y 1 , * • •, Vk/ 2 )• 

2. Compute bitjmerge k (b). 


Theorem 2. The output of Network^ consists of sorted k largest elements from 
input l :: r, assuming that l £ N ra,/2 is top k sorted and r £ N ra//2 is top k/2 sorted 
and (Zi,..., l k / 2 ) dominates (ri,..., r k / 2 ). 


Proof. We have to prove two things: (1) b is bitonic and (2) b consists of k largest 
elements from l :: r. 

(1) Let j be the last index in the sequence ( k/2 + 1,..., k), for which lj > 
r k -j+ 1 - If such j does not exist, then (yi, ■ ■ ■ ,y k / 2 ) is nondecreasing, hence b 
is bitonic (nondecreasing). Assume that j exists, then (yj- k / 2 +i, ■ ■ ■ > Vkl'i) is 
nondecreasing and (y±,... ,y k -j) is nonincreasing. Adding the fact that l k / 2 > 
4 / 2+1 = 2/i proves, that b is bitonic (v-shaped). 

(2) By Observation |2j it is sufficient to prove that b A (y k / 2 +i, ■ ■ ■, y k )- 
Since V k / 2<j < k 4/2 > lj > min{4, r k - j+1 } = y 3 k/ 2 -j+i, then (4, •••,4/2) >r 
(Vk/ 2 + 1 , ■ • ■ ,2/fc) and by Definition[lO| (y ± ,..., y k/2 ) A (y k/ 2 + 1 ,y k ). Therefore 
b consists of k largest elements from l :: f. 

The bitonic merger in step 2 receives a bitonic sequence, so it outputs a 
sorted sequence, which completes the proof. □ 


The first step of improved pairwise merger is illustrated in Figure [4j We 
use k /2 comparators in the first step and fclogfc /2 comparators in the second 
step. We get a merger of size A: log k/2 + k/2 , which is better than the previous 
approach. In the following it is shown that we can do even better and eliminate 
k /2 term. 

The main observation is that the result of the first step of pwJbitjmerge oper¬ 
ation: ( 61 , b 2 ,..., b k ) is not only bitonic, but what we call v-shape s-dominating. 

Definition 11 (s-domination). A sequence b = ( b \, b 2l ..., b k ) is s-dominating 

*/Vi<j<fc / 2 bj > b k _j + 1. 
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Fig. 4: Making the bitonic sequence. Arrows on the right picture show directions 
of inequalities. Sequence on the right is v-shape s-dominating at point i. 


Lemma 2. If b = (b±, & 2 , • • •, bk) is v-shaped and s-dominating, then b is non- 
increasing or 3 k / 2 <i< k bi < b i+1 . 

Proof. Assume that b is not nonincreasing. Then 3i <j< k bj < bj+ 1 . Assume that 
j < k/ 2. Since b is v-shaped, bj+i must be in nondecreasing part of b. If follows 
that bj < bj +1 < .. • < b k /2 < ... < b k -j+\. That means that bj < b k -j+ 1 - On 
the other hand, b is s-dominating, thus bj > b k —j+i ~~ a contradiction. □ 

We will say that a sequence b is v-shape s-dominating at point i if i is the 
smallest index greater than k/2 such that bi < bi+i or i = k for a nonincreasing 
sequence. 


Lemma 3. Let b = (b\, 62 , • ■ •, b k ) be v-shape s-dominating at point i, then 
( 61 ,..., b k j 4 ) y (b k / 2 +i,..., b 3 k / 4 ). 


Proof. If b is nonincreasing, then the lemma holds. From Lemma|2j k/2 < i < k. 
If i ^> 3fc/4, then by Definition |4| bi ^ ^ b 3 k j^ ^ ^ bi, so lemma holds. 

If k /2 < i < 3fc/4, then by Definition BJ 61 > ... > bi, so { 61 ,..., b k /f) y 
{bk/ 2 + 1 , ■ ■ ■ ,h). Since bi < b i+1 < ... <b 3l 
b 3 k / 4 - By Definition If and|4j 


< 03k /4 


it suffices to prove that 6 


b k /A > ^3fe/4+l > b 3k / 4 . 


'fc/4 > 

□ 


Definition 12 (half splitter). A half splitter is a comparator network con¬ 
structed by comparing inputs (fc/4 + 1, 3fc/4 + 1),..., (k/2, k) (normal splitter 
with first fc/4 comparators removed). We will call it halfsplit k . 


Lemma 4. If b is v-shape s-dominating, then hal f _split k (b) = split k (b). 


Proof. Directly from Lemma |3] □ 

Lemma 5. Let b be v-shape s-dominating. Following statements are true: (1) 
left(half-split k (b )) is v-shape s-dominating; ( 2 ) right(half split k (b)) is bitonic; 
(3) left(half-split k (b )) y right(halfsplit k (b)). 

Proof. (1) Let y = left(halfsplit k (b)). First we show that y is v-shaped. If y is 
nonincreasing, then it is v-shaped. Otherwise, let j be the first index from the 
range {!,...,fc/2}, where yj-i < yj. Since yj = max{bj,bj +k / 2 } and yj~i > 
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bj -1 > bj , thus bj < bj + k/ 2 - Since b is v-shaped, element bj +k /2 must be in 
nondecreasing part of b. It follows that bj > ... > b k /2 and bj +k / 2 < ... < b k . 
From this we can see that Vj<j '< k /2 Uj' = ma n{bji ,bj* +k / 2 } = bj / +k / 2 , so yj < 
... < Uk/ 2 - Therefore y is v-shaped. 

Next we show that y is s-dominating. Consider any j , where 1 < j < fc/4. 
By Definition |4| and fTTj b j > bk/ 2 -j+i and bj > b k -j+ 1 , therefore yj = bj > 
max{b k / 2 -j+i7bk-j+i} = Vk/ 2 -j+ii thus proving that y is s-dominating. Con¬ 
cluding: y is v-sliape s-dominating. 

(2) Let 0 = right(hal f split k (b)). By Lemma [ 4 J z = right(split k (b)). We 
know that b is a special case of bitonic sequence, therefore using Lemma [l] we 
get that z is bitonic. 

(3) Let w = half split k (b). By Lemma |4j w = split k (b). We know that b is 

a special case of bitonic sequence, therefore using Lemma [l] we get left(w) >: 
right(w). □ 

Using half split and Batcher’s bitjmerge and successively applying Lemma 
[5] to the resulting v-shape s-dominating half of the output, we have all the tools 
needed to construct the improved pairwise merger using half splitters: 

Network 4 ( pw-hbit_merge k ). Input: l :: f, where l £ N™/ 2 is top k sorted and 
I £ N K//2 is top k/2 sorted and (Zi ,... ,l k / 2 ) dominates (r 1 ,... ,r k / 2 )- 

1. Compute y = bitsplit k (l k/2+1 ,..., l k , r x ,..., r k/2 ), let b = (l l 7 ...,l k/2 ) :: 

2. Compute half Jbitjmerge k {b): 

(a) If k = 2 , return. 

(b) Let b' = halfsplit(bi ,..., b k ). 

(c) Recursively compute V = halfjbit_merge k ^ 2 {left(b 1 )). 

(d) Compute r' = bitjmerge k / 2 (right(b')). 

(e) Return V :: r'. 


The following theorem states that the construction of pwJibitrmerge k is 
correct. 

Theorem 3. The output of Network^ consists of sorted k largest elements from 
input l :: f, assuming that l £ N"/ 2 is top k sorted and f £ N"/ 2 is top k/2 sorted 
and {h,...,lk/ 2 ) dominates (ri,..., r k / 2 ). Also \pw-hbitjmerge k \ =k log k/2. 

Proof. Since step 1 in Network [4] is the same as in Network [3j we can reuse the 
proof of Theorem [ 2 ] to deduce, that b is v-shaped and is containing k largest 
elements from l :: f. Also, since V i<j < k /2 lj A lk-j+i and lj > rj, then bj = lj > 
ma,x{l k _j + i,rj} = b k -j+i, so b is s-dominating. 

We prove by the induction on fc, that if b is v-shape s-dominating, then the 
sequence halfJbitjmerge k (b) is sorted. For the base case, consider k = 2 and a 


v-shape s-dominating sequence ( 61 , b 2 ). By Definition 11 this sequence is already 
sorted and we are done. For the induction step, consider b' = hal f split k (b). By 
Lemma [ 5 ] we get that left(b r ) is v-shape s-dominating and right(b') is bitonic. 
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Using the induction hypothesis we sort left(b') and using bitonic merger we 
sort rightfb’). By Lemma[5j leftib 1 ) right(&), which completes the proof of 
correctness. 

As mentioned in Definition 12 halfsplit k is just split k with the first k/4 
comparators removed. So half-bit jmerge k is just bit rnerge k with some of the 
comparators removed. Let’s count them: in each level of recursion step we take 
half of comparators from split k and additional one comparator from the base 
case (k = 2). We sum them together to get: 


1 + 


log k —2 

E ■ 

2=0 


^log k— 1 


2 i+2 


= > + i £ 


i -0 



Therefore we have: 


\pwJibit-merge% \ = k /2 + k log k /2 — k /2 = k log k/2 


k 

2 


□ 


The only difference between pwsel and our pwJibitsel is the use of improved 
merger pw-hbit -merge rather than pw-merge. By Theorem [3j we conclude that 
\pwjmerge^\ > \pwJibit-merge^\, so it follows that: 

Remark 1. \pw-hbitseVf\ < \pwsel%\ 


5 Sizes of new selection networks 

In this section we estimate the size of pw-kbitsel^. To this end we show that the 
size of pw-hbitsel% is upper-bounded by the size of bitsel'f and use this fact in 
our estimation. We also compute the exact difference between sizes of pwsel% 
and pwJibitsel% and show that it can be as big as nlogn/2. Finally we show 
graphically how much smaller is our selection network on practical values of n 
and k. 

We have the recursive formula for the number of comparators of pwJibitsel^: 

! \pw-hbitsel^ 2 \ + \pw-hbitsel2jl\ + 

+ \split n \ + \pw-hbit-merge k \ if k<n (3) 

|oe_sorf fc | if k = n 

\max n \ if k = 1 

Lemma 6. \pw-hbitsel%\ < \bitsel^\. 

Proof. Let auxseVf be the comparator network that is generated by substituting 
recursive calls in pw-hbitsel% by calls to bit sel r f. Size of this network (for 1 < 
k < n) is: 
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\auxsel k \ = \bitsel^ 2 \ + \ bitsel^fy + \ split 71 \ + \pwJibitjmerge k \ 
Lemma [6] follows from Lemma [7] and Lemma [8] below, where we show that 
\pwJibitsel k \ < \auxsel k \ < \bitsel k \ 

Lemma 7. For 1 < k < n (both powers of 2), \auxsel k \ < \bitsel k \. 

Proof. We compute both values from equations [2] and [4} 

i mi 1,9. 5 1, , 5 3n 

\aux-sd k \ = —n log k + -n - -klogk- -k- — 

\bit_selu\ = -nlog 2 k H —nlosk + 2 n - kiosk — k - 

i fcl 4 4 2 k 

We simplify both sides to get the following inequality: 

1 n 1 . . 

n — -k — — < -[n — k) log k 

Z rv Z 

which can be easily proved by induction. □ 

Lemma 8. For 1 < k < n (both powers of 2), \pwJibitsel k \ < \auxsel k \. 

Proof. By induction. For the base case, consider 1 = k < n. If follows by defi¬ 
nitions that \pw-hbitsel k \ = |cmx_se^| = n — 1. For the induction step assume 
that for each (?i', k') -< ( n , k ) (in lexicographical order) the lemma holds, we get: 


(4) 


0 


\pw dibits el k | 

= \pw-hbitsel(.jl\ + \pwJibit.sel^ 2 \ + \split n \ + \pw dibit jmerge k \ 

(by the definition of pw-hbitsel ) 
< \auxsel n J 2 1 + \auxseC k ~\ + \split n \ + \pw dibit rmerge k \ 


l k /2 I 

< \bitsel(Jj^\ + \bit_sel]( 
= I auxsel 


(by the induction hypothesis) 

- 1 - \split n \ + \pw dibit jmerge k \ 

(by Lemma [7]) 

(by the definition of auxsel ) 


□ 


Let N = 2 n and K = 2 k . We will compute upper bound for P(n,k) = 
\pw dibit self^ using B(n,k) = \bitsel^(\. 
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Lemma 9. Let: 


m —1 i / .\ m / \ 

P(n,k,m) Qj (0 - j)2 fe “ J “* 1 + 2 n ~ l ~ 1 ) +Y^ jP(n-m,k-i). 

Then Vo< m < m in(fc,n— k) P(n->k,rri) P(n,k). 

Proof. The lemma can be easily proved by induction on m. □ 

Lemma 10. P(n, k, m) < 2 n ~ 2 (jk - f f + k + 7 -f + 8) +2 k (|) m (f - f) - 
2 k (k + 1) - 2 n ~ k (|) m . 

Proof. First inequality below is a consequence of Lemma[9]and[6j We also use the 
following equations: J 2 k=o (k) xk ~ ljc = n (l + z) n_ \ Sl-=o (fc)& 2 = n(n + l)2 n_2 , 

ryk-ll, _ + 

Z^A;=0 x ^ — (1-x) 2 


m—1 2 


i=0 1=0 


P(n,k,m) ((k-j) 2 k J ®.+ 2" 1 x ) + ^ ( ,• ) B (. n ~ k ~ *) 


0 


i) 


_ I ofc / _ 

2 


(it + 1 " y) “ 2 *(fc + !) + m 2 ”- 1 ! 
+ 2-* + 


+ 2 fc 
= 2 n ~ 2 


- i | -2' 
2 6 1 




m 


7m 


fc - - + fc H—:—I- 8 1+2- 


fc m 
2 ~~ 6" 


-2 fc (fc + l) -2' 


n—k 


m— 1 2 / - \ 

EE • ((^ — i) 2 fe -‘ ,_1 + 2 "-* _1 ) ( 5 ) 

2 — 0 j=0 
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m— 1 i 


i —0 j —0 


m— 1 i 


EE = * 2 ‘-‘EE ,p- 2 “EEL 2 to 


2=0 J—0 


771—1 1 


2=0 j—0 


( 6 ) 



fe-1 1 


771—1 / 0 \ 2—1 


2 ' V2 

i=0 


-1 | — 2 fc_1 I 2 - | ^ 


(3 — to) 


(!) (^ + !-y)-2 fc ( fc + 1 ) 


771—1 i 


2=0 J—0 


771—1 


771—1 


■) T ~ i ~ 1 = H 2n_I ” 1 H(- = H 2n ~ i_l21 = w 2 " -1 (7) 


i=0 


3=0 


i=0 


^ . (2 n ~ m ~ 2 (k - i) 2 + 2 n ~ m - 2 (k -i)- 2 k ~ i ~ 1 (k - i ) 

*=o ^ * 2 

_ 271 — 771 — k~\~i _|_ 271 — 771+1 _ 2 k ~ i ) ( 8 ) 

771 / \ 771 / \ 771 / \ 

= f; f W ) 2" _m_2 (fc - i) 2 + 53 2 n ~ m ~ 2 {k - *) - f; ( m ) 2 k ~ i ~ 1 {k - i) 


i =0 


_ ( m ) 2 n_m_fc+i + ^ ( m ) 2 n_m+1 — ^ | 171 j 2 


i=0 

= 2 n ~ rn ~ 2 (k 2 2 m - fcrn2 m + to(to + l)2 m_2 ) + 2 n ~ m ~ 2 {k 2 m - m2"*- 1 ) 

- 2 fe_1 (fc - 2- m 3 m " 1 TO N ) - 2 n-m-fc 3 m + 2 n+1 - 2 fe ^ 


i=0 


i=0 

m 


i=0 


-%k—i 


i =0 

771—2 \ | r)71 — 77l —2 / 7.r)771 ___ rim— 1 \ 


= 2- 2 ('^-tm+ m + 1) +t + 8) 


+ 2 fc ( - ) - 1 ) - 2 


%—k 


□ 


Theorem 4. Form = min(/c,n—fc), P(n,k) <2 n 2 k — ™ ~~ |) 2 + ^ 
+2 fc (a)™ (| - f) - 2 k (k + 1) - 2 n ~ k (§) m . 


Proof. Directly from Lemmas [9| and [To! 


□ 



Smaller Selection Networks for Cardinality Constraints Encoding 


15 


We will now present the size difference SD(n, k) between pairwise selection 
network and our network. Merging step in pw.sel^ costs 2 k k — 2 k + 1 and in 
pw-hbitsel 2 k ~ 1 k, so the difference is given by the following equation: 


SD(n , k) 


0 if n = k 

0 if k = 0 

2 k ~ l k - 2 k + 1+ 

+SD(n — 1, k) + SD(n — 1, k — 1) if 0 < k < n 


(9) 


Theorem 5. Let S nM = £* =0 ( n ~ k+J )2 k -f Then: 

M fn\n +1 n-2fc + l k 

SD(n, k) = \\ — - S n>k ---2 k {k - 1) - 1 

Proof. By straightforward calculation one can verify that S n p = 1, S Uyn = 
2 n+1 - 1, S n -i,k-i = \{S n ,k - (fc)) and S n - lyk -i +S n - lyk = S n>k . It follows that 
the theorem is true for k = 0 and k = n. We prove the theorem by induction 
on pairs ( k,n ). Take any (k, n), 0 < k < n, and assume that theorem holds for 
every (fc , ,n , ) -< (fc,n) (in lexicographical order). Then we have: 


SD(n, k ) = 2 k ~ L k - 2 k + l + SD{n - 1, jfe) + SD(n - 1, k - 1) 


= 2 k ~ k k - 2 k + l + 


n — l\n 
k 


n — 1\ n 


k- 12 


— 2 k (k — 1) — 1 


- 2 k ~\k - 2 ) - 1 - ( 5 n _ lifc %^ + S n _ r.fc-r ” 2 * + 2 ) 


n \ n c « - 2fc c ofc 

k) 2 Dn ’ k 9 


2 2 
— S n -i,k-i ~ 2 k (k — 1) — 1 


n\n + l „ n — 2 k + 1 


— S’™ 


— 2 fe (fc — 1) — 1 


□ 

Corollary 1. \pwselft/ 2 \— \pwJiMtsel^^ 2 \ = A^ log ^ 4 +logiV+2, for N = 2 n . 

Plots in figure [5] show how much pwsel and the upper bound from Theorem 
[4]are worse than pwJibitsel. Lines labeled codish are plotted from (\pwsel^\ — 
\pw-hhit-Sel^l)/\pw-hbit-Selftl and the ones labeled upper are plotted from the 
formula (\upper^\ — \pwJibitsel^\)/\pwJibitsel^l, where \upper^\ is the up¬ 
per bound from Theorem [ 4 ] Both \pwsel^\ and \pwJibitsel^\ were computed 
directly from recursive formulas. We can see that we save the most number of 
comparators when k is larger than n/ 2 , nevertheless for small values of n supe¬ 
riority of our network is apparent for any k. As for the upper bound, it gives a 
good approximation of \pw-hbitsel^ \ when n is small , but for larger values of 
n it becomes less satisfactory. 










16 


Michal Karpinski, Marek Piotrow 



Fig. 5: Comparison of pairwise selection networks for pratical values of n and k. 


6 Arc-consistency of selection networks 

In this section we prove that half encoding of any selection network preserves 
arc-consistency with respect to ”less-than” cardinality constraints. The proof 
can be generalized to other types of cardinality constraints. 

We introduce the convention, that {x±,... ,x n ) will denote the input and 
(j/i,..., y n ) will denote the output of some order n comparator network. We 
would also like to view them as sequences of boolean variables, that can be set 
to either true (1), false (0) or undefined (X). 

From now on we assume that every network / is half encoded and when we 
say ’’comparator” or ’’network”, we view it in terms of CNF formulas. We denote 
V[(j){f)] to be the set of variables in encoding 

Observation 4. A single comparator hcomp(a , 6, c, d) has the following propa¬ 
gation properties: 

1 . If a = 1 or b = 1, then UP sets c = 1 f6?/[Tjcl or[T]c2 ). 

2. If a = b = 1, then UP sets c = d = 1 (7jy [ijcl andJlJcS,). 

3. If c = 0, then UP sets a = b = 0 (7)?/[l]cl and[l]c2,). 

4■ If b = 1 and d = 0, then UP sets a = 0 (7jt/|TJc3 J. 

5. If a = 1 and d = 0, then UP sets 6 = 0 ("6j/[lJc3). 


Lemma 11. Let fjf be a selection network. Assume that k — 1 inputs are set 
to 1, and rest of the variables are undefined. Unit propagation will set variables 
3/1, • • • ,2/fc-i to 1. 

Proof. From propagation properties of hcomp(a, 6, c, d) we can see that if com¬ 
parator receives two Is, then it outputs two Is, when it receives 1 on one input 
and X on the other, then it outputs 1 on the upper output and X on the lower 
output. From this we conclude that a single comparator will sort its inputs, as 
long as one of the inputs is set to 1. No 1 is lost, so they must all reach the out¬ 
puts. Because the comparators comprise a selection network, the Is will appear 
at outputs yi,..., yk-i- □ 





Smaller Selection Networks for Cardinality Constraints Encoding 


17 


The process of propagating Is we call a forward propagation. For the remain¬ 
der of this section assume that: ff is a selection network; k — 1 inputs are set 
to 1, and the rest of the variables are undefined; forward propagation has been 
performed resulting in yi, ..., yk-i to be set to 1. 

Definition 13 (path). A path is a sequence of boolean variables (zi,...,z m ) 
such that \/i<i< m Zi G )] ani ^ f or oil 1 <i < m there exists a comparator 

hcomp{a, b, c, d) in 4>(f k ) for which Zi G {a, b} and Zj+i G {c, d}. 

Definition 14 (propagation path). Let x be an undefined input variable. A 
path z x = (zi ,..., z m ) (m > 1) is a propagation path, if Z\ = x and (z 2 , ■ ■ ■, z m ) 
is the sequence of variables that would be set to 1 by UP, if we would set Z\ = 1. 

Lemma 12. If z x = {z \,..., z m ) is a propagation path for an undefined variable 
x, then z m = y k . 

Proof. Remember that all yi,... ,y k - 1 are set to 1. Setting any undefined input 
variable a: to 1 will result in UP to set y k to 1. Otherwise ff would not be a 
selection network. □ 


The following lemma shows that propagation paths are deterministic. 


Lemma 13. Let z x = {z\,... ,z m ) be a propagation path. For each 1 < i < m 
and z[ = Zi, if (z[,..., z' m ,) is a path that would be set to 1 by UP if we would 
set z[ = 1, then (z[,.. .,z' m ,) = ( z ,,... ,z m ). 


Proof. By induction on l = m — *. If l = 0, then z[ = z m = y k (by Lemma 121, so 
the lemma holds. Let l > 0 and assume that the lemma is true for zi. Consider 
z\ = zi-i = z m -i- 1 - Set z m -i -i = 1 and use UP to set z m -i = 1. Notice that 
z m -i = z' 2 , otherwise there would exist a comparator hcomp(a , b, c, d), for which 
z m _i _i is equivalent to either a or b and = c and z ' 2 = d (or vice versa). 
That would mean that a single 1 on the input produces two Is on the outputs. 
This contradicts our reasoning in the proof of Lemma El By the induction 


hypothesis (z: 


2 ) ■ 


— ( Z m -i 


^), so 


■ 


l‘) — ( Zm-i- 


1 ) • 


,)• 

□ 


For each undefined input variable x and propagation path z x = {z\,..., z m ) 
we define a directed graph P x = {(.Zj, Zj + i) : 1 < i < in}. 


Lemma 14. Let {x^,... ,Xi t } (t > 0) be the set of undefined input variables. 
Then T = P x U ... U P Xit is the tree rooted at y k . 


Proof. By induction on t. If t = 1, then T = P Xi and by Lemma 12 P Xi ends 
in y k , so the lemma holds. Let t > 0 and assume that the lemma is true for t. 
We will show that it is true for t + 1. Consider T = P x U ... U P x U P x 


By the induction hypothesis T' = P Xi U ... U P Xi is a tree rooted at y k ■ By 
Lemma [l2| V{P Xit+1 ) H V ( T') ^ 0. Let z G V[P Xit+1 ) be the first variable, such 
that 2 Gv(T'). Since z G V(T'), there exists j (1 < j < t) such that z G P Xi ■ 
By Lemma 1 131 starting from variable z, paths P Xit+1 and P X j are identical. □ 
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Graph T from the above lemma will be called a propagation tree. 

Theorem 6. If we set pk = 0, then unit propagation will set all undefined input 
variables to 0. 

Proof. Let T be the propagation tree rooted at yk■ We prove by induction on 
the height h of T, that (*) if we set root of T to 0, then all nodes of the tree will 
be set to 0, thus all undefined input variables will also be set to 0. If h = 0, then 
V = {yk}, so (*) is trivially true. Let h > 0 and assume that (*) holds. We will 
show that (*) holds for height h + 1. Let T' be the propagation tree of height 
h + 1 and let r = 0 be the root. Consider children of r in T' and a comparator 
hcomp(a , 6, c, d) for which r £ {c, d}: 

Case 1: r has two children. The only case is when r = c = 0. Unit propagation 
sets a = b = 0. Nodes a and b are roots of propagation trees of height h and are 
set to 0, therefore by the induction hypothesis all nodes in T' will be set to 0. 

Case 2: r has one child. Consider two cases: (i) if r = c = 0 and either a or & is 
the child of r, then UP sets a = b = 0 and either a or b is the root of propagation 
tree of height h and is set to 0, therefore by the induction hypothesis all nodes 
in T' will be set to 0, (ii) r = d = 0 and either a = c = 1 and b is the child of 
r or b = c = 1 and a is the child of r. Both of them will be set to 0 by UP and 
again we get the root of propagation tree of height h that is set to 0, therefore 
by the induction hypothesis all nodes in T' will be set to 0. □ 

7 Conclusions 

We have constructed a new family of selection networks, which are based on the 
pairwise selection ones, but require less comparators to merge subsequences. The 
difference in sizes grows with k and is equal to n >os ^~ 4 + logn + 2 for k = n/2. 
In addition, we have shown that any selection network encoded in a standard 
way to a CNF formula preserves arc-consistency with respect to a corresponding 
cardinality constraint. This property is important, as many SAT-solvers take 
advantage of arc-consistency, making the computation significantly faster. 

It’s also worth noting that using encodings based on selection networks give 
an extra edge in solving optimization problems for which we need to solve a 
sequence of problems that differ only in the decreasing bound of a cardinality 
constraint. In this setting we only need to add one more clause ~^yk for a new 
value of k. and the search can be resumed keeping all previous clauses as it is. 
This works because if a comparator network is a /c-selection network, then it is 
also a fc'-selection network for any k' < k. This property is called incremental 
strengthening and most state-of-the-art SAT-solvers provide a user interface for 
doing this. 
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